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ABSTRACT 



We study the cross-correlation of APM and SDSS galaxies with background 
QSOs taken from the 2dF QSO Redshift Survey, and detect a significant (2.8a) anti- 
correlation. The lack of a similar signal between stars in the 2dF survey and our 
• galaxy samples gives us confidence that the anti-correlation is not due to a systematic 

error. The possibility that dust in the foreground galaxies could produce such an anti- 
l/"") , correlation is marginally rejected, at the 2a level through consideration of the colours 

■ of QSOs behind these galaxies. It is possible that a dust model that obscures QSOs 

without reddening them, or preferentially discards reddened QSOs from the 2QZ sam- 
ple, could produce such an anti-correlation, however, such models are at odds with 
1 1 the positive QSO-galaxy correlations found at bright magnitudes by other authors. 

Our detection of a galaxy-QSO anti-correlation is consistent with the predictions 
of statistical lensing theory. When combined with earlier results that have reported 
a positive galaxy-QSO correlation, a consistent and compelling picture emerges that 
spans faint and bright QSO samples showing positive or negative correlations according 
to the QSO N(m) slope. 

We find that galaxies are highly anti-biased on small scales. We consider two mod- 
els that use quite different descriptions of the lensing matter and find they yield con- 
sistent predictions for the strength of galaxy bias on 0.1 /i -1 Mpc scales of b ~ 0.1 (for 
a ACDM cosmology). Whilst the slope of our power-law fit to the QSO-galaxy cross- 
correlation does not allow us to rule out a linear bias parameter, when we compare our 
measurement of b on 100/i _1 kpc scales to independent methods that determine 6^1 
on h~ 1 Mpc scales, we conclude that bias, on these small scales, is scale-dependent. 
These results indicate that there appears to be more mass present, at least on the 
100/i _1 kpc scales probed, than predicted by simple ACDM biasing prescriptions, and 
thus constrains halo occupation models of the galaxy distribution. 

Key words: surveys - quasars, quasars: general, large-scale structure of Universe, 
cosmology: observations, gravitational lensing 



1 INTRODUCTION 

Intervening massive structures can, via weak gravitational 
lensing, alter the density of high-redshift objects detected 
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behind them. Myers et al. (2003) demonstrated the anti- 
correlation between faint QSOs and groups of galaxies, 
confirming the earlier detection by Boyle, Fong & Shanks 
(1988). Using models of lensing by simple haloes (Croom & 
Shanks 1999) they concluded that, if due to gravitational 
lensing, the observed anti-correlation favoured more mass 
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in groups of galaxies than accounted for in a universe with 
density parameter Q. m — 0.3. 

Though statistical lensing is most apparent for larger 
concentrations of mass, such as groups of galaxies, most re- 
cent attempts to measure and model associations between 
QSOs and foreground mass have focussed on the cross- 
correlation between QSOs and individual galaxies. Seldner 
& Peebles (1979) were among the first authors to record 
a statistically significant clustering of individual galaxies 
with bright, high-redshift QSOs, although Tyson (1986) ap- 
pears to be the first to have mentioned lensing as a possible 
explanation. Webster et al. (1988) developed a statistical 
lensing explanation for the association of QSOs with fore- 
ground galaxies, suggesting that more lensing mass must be 
being traced than expected (Kovner 1989; Narayan 1989; 
Schneider 1989). Since then, more authors have found posi- 
tive correlations between optically-selected QSOs and galax- 
ies (Thomas, Webster & Drinkwater 1995; Williams & Ir- 
win 1998). Many more have measured positive correlations 
between galaxies and radio-selected QSOs (Fugmann 1990; 
Bartsch, Schneider & Bartelmann 1997; Benitez & Martinez- 
Gonzalez 1997; Norman & Williams 2000), for which a larger 
lensing effect is expected. Very few anti-correlations between 
QSOs and galaxies have been detected (Benitez & Martinez- 
Gonzalez 1997; Ferreras et al. 1997), and these were mainly 
attributed to QSO-selection effects. However, as detected by 
Myers et al. (2003), an anti-correlation between faint QSOs 
and foreground matter is predicted by statistical lensing. 

The galaxy distribution, relative to underlying mass, 
is a function of bias, whereas the QSO distribution di- 
rectly traces the mass that lensed it. Thus, comparing the 
QSO-galaxy cross-correlation to the galaxy auto-correlation 
can be used to constrain galaxy bias. Recent observations 
of positive correlations between bright QSOs and galax- 
ies (Williams & Irwin 1998; Gaztahaga 2003) suggest a 
very low value for the bias parameter (b ~ 0.1). The work 
of Gaztanaga (2003) suggests 6 ~ 0.1 on sub-Megaparsec 
scales. The work of Williams & Irwin (1998) suggests b ~ 0.1 
on Megaparsec scales but is consistent with b ~ 0.1 on sub- 
Megaparsec scales. However, measurements of the strength 
of bias from clustering in galaxy surveys out to (redshift) 
z ~ 0.2 (Verde et al. 2002), comparisons of local galaxy 
clustering with the Cosmic Microwave Background (Lahav 
et al. 2002) and weak lensing shear (Hoekstra et al. 2002) 
at 2 ~ 0.35, seem to converge on a linear model of bias 
with b, of the order of unity on scales of 5 — 100 Mpc. Taken 
together, these results suggest that either there is a strong 
scale-dependence to bias, or that there exists an unexpected, 
strong systematic effect inducing positive correlations be- 
tween QSOs and foreground galaxies. 

The faint flux limit of the 2dF QSO Redshift Survey 
(Croom et al. 2004), henceforth referred to as the '2QZ', may 
test these two possibilities. As the 2QZ number-magnitude 
count slope at the survey flux limit is relatively flat, statisti- 
cal lensing predicts an anti-correlation between 2QZ QSOs 
and foreground galaxies, as opposed to the positive correla- 
tions predicted for brighter QSO samples and detected by 
Williams & Irwin (1998) and Gaztanaga (2003). As there is 
no other explanation for such opposite signals in the different 
samples, such a detection would provide a strong confirma- 
tion of the lensing hypothesis, and hence the constraints on 
b. There are additional reasons why the 2QZ is appropriate 
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Figure 1. The QSO integrated number counts, for the 2QZ, in 
0.2 mag bins, with Poisson errors. The line is a smoothed power 
law fit to the differential number counts. Brighter data points are 
from the 6QZ. Also displayed are the faint data from Boyle, Jones 
& Shanks (1991) and Koo & Kron (1988), which have been offset 
slightly to prevent the points from merging. 

to study statistical lensing. The 2QZ contains around half as 
many confirmed stars as QSOs, which may be used as a con- 
trol, to determine if any cross-correlation arises from target- 
selection. The 2QZ can limit the most likely systematic, 
intervening dust — by considering QSO colours, the extent 
that dust in galaxies could obscure background QSOs may 
be determined. Finally, the known redshifts of 2QZ QSOs 
can ensure that they are physically removed from a galaxy 
sample. 

This paper regards the cross-correlation between faint 
QSOs and foreground galaxies, and its implications for cos- 
mological parameters, particularly for galaxy bias. In Sec- 
tion 2 we outline the samples of QSOs and galaxies we shall 
cross-correlate. In Section 3 we outline our cross-correlation 
methodology, and investigate possible explanations for the 
resulting signal. Section 4 introduces models we use to inves- 
tigate the cross-correlation of QSOs and galaxies in terms 
of statistical lensing. Section 5 applies these lensing models 
and discusses implications for cosmological parameters, es- 
pecially how galaxies are biased relative to underlying mat- 
ter. Finally, in Section 6, we summarise the main results of 
this paper. 



2 QSO AND GALAXY SAMPLES 

The QSO and galaxy samples we will cross-correlate are es- 
sentially the same as described by Myers et al. (2003) , so we 
will only outline them briefly. QSOs are taken from the final 
2QZ catalogue (Croom et al. 2004) . The 2QZ comprises two 
5 deg x 75 deg declination strips, one in an equatorial region 
in the North Galactic Cap (centred at 5 = 0° with 09 h 50 m 
<, a <, 14 h 50 m ), and one at the South Galactic Pole (centred 
at 6 = -30°, with 21 h 40 m & a & 03 h 15 m ). We will refer to 
these regions as the 'NGC and 'SGC respectively. QSOs are 
selected by ultra-violet excess (UVX) in the u — bj : bj — r 
plane, in the magnitude range 18.25 < 6j < 20.85. The 2QZ 
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colour selection is <; 90 per cent complete for UVX QSOs 
over the redshift range 0.3 < z < 2.2. At higher redshifts the 
UVX technique fails as the Lyman-alpha forest enters the 
U band, and the completeness of the survey rapidly drops. 
Unless otherwise specified, we will consider only QSOs with 
a redshift z > 0.4, to prevent the overlap in real space of 
QSO and galaxy samples. We shall work with only the most 
definitively identified QSO sample, the so-called quality '11' 
sample. Quality '11' denotes a sample with the best level of 
reliability for both the QSO identification and redshift esti- 
mate (see Croom et al. 2004 for further explanation). These 
restrictions in redshift and spectroscopic quality leave 12042 
QSOs in the SGC and 9565 in the NGC. We shall also con- 
sider the supplemental 6dF QSO survey (henceforth 6QZ), 
which contains 376 QSOs after the application of our z > 0.4 
and '11' only spectroscopic identification criteria. For further 
details of the 2QZ and 6QZ, see Croom et al. (2004). 

The expected strength of lensing-induced correlations 
between galaxies and a magnitude-limited sample of QSOs 
depends on the slope of the integrated number-magnitude 
counts, q, fainter than the QSO sample's limit (Narayan 
1989). An enhancement of QSOs is expected behind fore- 
ground lenses when the slope of the QSO number-magnitude 
counts is greater than 0.4 and a deficit when the slope is less 
than 0.4. We thus need to estimate this slope for any lensing 
analysis. In Fig. 1 we reproduce the QSO N(< m) for the 
2QZ from Myers et al. (2003). The points are QSO num- 
ber counts in 0.2 mag bins with Poisson errors, and have 
been corrected for incompleteness and absorption by dust 
in our Galaxy. The line is a smoothed power law fit to the 
differential counts of form 



dN _ 

dm 10- Q dO-"H)) -)- \Q-f) A {m-m a ) 



(1) 



with a steep bright-end slope (/3d = 0.98), a knee at 
mo = 19.1, and a flatter faint end slope (e*d = 0.15). Myers 
et al. (2003) marginalised across all of the parameters in this 
fit and determined a faint end slope of a — 0.29 ±0.015, not- 
ing that incompleteness corrections might raise the la error 
to as much as 0.05. In this paper, we will assume a slope 
of a = 0.29 ± 0.03 at the survey flux limit. Brighter data 
points are from the 6QZ. Also displayed are the faint data 
from Boyle, Jones & Shanks (1991) and Koo & Kron (1988), 
which have been offset slightly to prevent points from merg- 
ing. Recent observations, attributed to statistical lensing, of 
positive correlations between QSOs and galaxies (Williams 
& Irwin 1998; Gaztanaga 2003) have used relatively bright 
QSO samples, which only probe the steep QSO number- 
magnitude counts below the knee. The large size and faint 
flux limit of the 2QZ allows us to probe significantly beyond 
the knee and test the lensing prediction of an anti-correlation 
between faint QSOs and galaxies. 

The southern galaxy sample we consider in this paper is 
taken from the APM Survey (Maddox et al. 1990b) , which 
is considered photometrically complete to a magnitude of 
6j < 20.5 (Maddox et al. 1990a). The northern galaxy sam- 
ple is taken from the Sloan Digital Sky Survey (henceforth 
SDSS) Early Data Release (henceforth EDR) of June 2001 
(Stoughton et al. 2002). The SDSS EDR sample is trans- 
formed into the 6j band from the SDSS g' and r' bands 
using the colour equations of Yasuda et al. (2001) and cut 



to 6j < 20.5 to match the APM limit. Both galaxy samples 
are restricted to areas in which they overlap the 2QZ strips. 
This leaves nearly 200,000 galaxies in the SGC 2QZ strip 
and 100,000 in the NGC 2QZ strip. Note that the SDSS 
EDR only partially fills the 2QZ NGC strip. 



3 QSO AND GALAXY CROSS-CORRELATION 
FUNCTIONS 

Correlation functions (Peebles 1980) are the main statistic 
of choice in studies of how QSO and galaxy distributions are 
related, although how the statistic is estimated varies con- 
siderably. In this section, we measure the two-point cross- 
correlation between SDSS or APM galaxies and 2QZ (or 
6QZ) QSOs. We study possible selection effects and different 
explanations for the signal. Notably, the expected variation 
of the QSO-galaxy cross-correlation with redshift and (espe- 
cially) magnitude of the QSO sample is a strong prediction 
of the statistical lensing hypothesis, and is something we 
may be able to test using 2QZ data. 



3.1 Correlation Estimator and Errors 

To measure the two-point correlation function, co(9), we use 
the estimator (Peebles 1980) 

DD 12 (0)n 

U{9) DRM6) ~ X ' (2) 

where DDu denotes the number of data-point pairs drawn 
from populations 1 and 2 respectively with separation 9. 
For DR12 the population 2 data are replaced with a cat- 
alogue of random points with the same angular selection 
function as the data. The factor n is the ratio of the size 
of the random catalogue to the data. Throughout this pa- 
per, we produce random catalogues with n — 10, minimizing 
statistical noise but allowing efficient speed of calculation. 
For further details on the 2QZ angular selection function see 
Outram et al. (2003) and Croom et al. (2004). Note that the 
cross-correlation function is equivalent in either 'direction' 
(i.e. under the exchange of labels 1 and 2 in Equation 2) 
provided that the angular selection of sources is accounted 
for by an appropriate random catalogue. In general, the an- 
gular selection functions of, say, a galaxy population and a 
QSO population are not the same. Further, the angular com- 
pleteness of a given survey is generally a function of redshift 
and magnitude, so considerable care should be taken in con- 
structing random catalogues for population subsamples. 

Numerous estimates of error on the cross-correlation 
have been proposed. We will consider three of these. One of 
the simpler forms, is the Poisson error based on the number 
of data-data pairs (across the entire survey) in the angular 
bin probed: 



[i + ^)] s 

DD{6) 



(3) 



where we will use to denote the standard error on the 
correlation function. Given that we wish to measure the 
significance of the correlation function compared to the 
null hypothesis represented by the random catalogue, we 
might instead use the Poisson error based on the number of 
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data-random pairs, which could be achieved by substituting 
DR/n for DD in Equation 3. 

The hypothesis that error on the correlation function is 
Poisson is not strictly fair. All else being equal, the num- 
ber of counts could be highly correlated as the same points 
appear in different pairs that are included in many differ- 
ent bins, especially on large scales. Some authors (Shanks 
& Boyle 1994; Croom & Shanks 1996) have suggested cor- 
rections to the Poisson form of error. Instead of such cor- 
rections, we will consider errors from field-to-field variations 
in the correlation function (see, e.g., Stevenson et al. 1985). 
Our data samples are split into 30 subsamples. This is arbi- 
trary in the case of the SDSS EDR but deliberately reflects 
plate boundaries in the case of the APM data (and by ex- 
tension the 2QZ, which is derived from APM photometry). 
The cross-correlation function is measured for each subsam- 
ple and the variance between the subsamples is determined. 
The standard error on co(0) is then the standard error be- 
tween the subsamples, inverse weighted by variance to ac- 
count for different numbers of objects in each subsample 



DR{9) 



(4) 



The weighting by number of objects is essential, mainly as 
plates in the southern APM immediately East of OOh RA 
cover less area than other plates at the same declination but 
also because of varying completeness in the QSO catalogue. 
We will refer to this error as field-to-field error. 

We will also consider a similar estimate of error to 
Equation 4, essentially a weighted form of the estimate pro- 
posed by Scranton et al. (2002) when calculating the auto- 
correlation of galaxies in the SDSS EDR 



N 

E 

L'=l 



DRl'(O) 
DR(0) 



[ Wi ,(0)-o,(0)] 2 



(5) 



Note that L (in Equation 4) refers to a subsample on one 
of our 30 plates, whereas L' refers to the subsample remain- 
ing on the other 29 plates. In other words, the procedure 
outlined in Equation 5 is to remove each of the plates in 
turn and to calculate the variance between each sample on 
the 29 remaining plates. We shall refer to this as jackknife 
error. The unweighted version of this estimate agrees well 
with simulations (see the appendix of Zehavi et al. 2002). 

Finally, we will measure the statistical correlation (not 
to be confused with the correlation function) between ad- 
jacent bins of w(9). The statistical correlation is related to 
the covariance, and is essentially an estimate of how inde- 
pendent the bins are - whether bin i has a tendency to take 
the same value as bin j. We might expect the covariance of 
the correlation function to be high, as the same data points 
can appear in different pairs that are counted in many dif- 
ferent bins. The statistical correlation takes the following 
form 



Corr(i,j) 



Cov(i,j) 



(6) 



where the covariance, Cov(i,j), is defined as 



Cov(i,j) = ^Y1 ~ °( fl 0)(wM(»i) - 0(0,-)) (7) 
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Figure 2. In the upper panel, we compare the Poisson, field- 
to-field and jackknife error estimates with a Monte Carlo error 
estimate determined from 100 Monte Carlo simulations of the 
NGC and SGC strips. In the lower panel we plot the mean of the 
100 simulations for the NGC and SGC individually and for the 
NGC and SGC combined. 



and 0i and 8j are two bins at different scales, and ui and a 
represent the mean and standard deviation over a number 
of realisations, M. The correlation is if the bins are in- 
dependent, approaches 1 if an increase in bin i leads to an 
increase in bin j and approaches —1 if an increase in bin i 
leads to a decrease in bin j. 

To test the accuracy of our correlation function esti- 
mator and the associated error, we have created 100 Monte 
Carlo simulations of the 2QZ. Each simulation contains the 
same number of (z > 0.4, identification quality of '11') QSOs 
as the 2QZ and has the same angular selection function, ne- 
glecting any intrinsic QSO clustering. We cross-correlate the 
simulated samples against APM galaxies (for the SGC strip) 
and the SDSS EDR (for the NGC strip). For each sample, 
the estimates of error outlined in this section are calculated 
and averaged. The mean value of the cross-correlation across 
the 100 samples is taken and the standard deviation (lo") is 
recorded as the Monte Carlo error. To avoid confusion be- 
tween our 'Monte Carlo error' and the statistical 'standard 
error' we shall refer to the Monte Carlo error as the 'Monte 
Carlo deviation'. 

In the lower panel of Fig. 2, we display the mean cross- 
correlation signal across the 100 Monte Carlo simulations. 
The agreement between the NGC and SGC results are excel- 
lent - better than 12 per cent of the Monte Carlo deviation 
on the NGC mean over all scales. The deviation of the com- 
bined result from zero, the expected result, is similarly no 
more than 12 per cent of the combined Monte Carlo devi- 
ation over all scales. We note that the shot noise (i.e. the 
standard error) would comprise 10 per cent of the Monte 
Carlo deviation (as we have 100 samples). Although the cor- 
relation function diverges on scales smaller than 0.3 arcmin, 
the error is sufficiently large on these scales that we may 
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Figure 3. In the upper panel we display the error on the corre- 
lation function taken in ratio to the Monte Carlo error estimate 
(1 standard deviation across the Monte Carlo simulations), for 
each estimate of error mentioned in the text. In all cases the er- 
rors are determined for the combined NGC and SGC sample. A 
dashed line is drawn for comparison at <t/<tmc = 1 (where the 
Monte Carlo estimate itself would lie). The lower panel depicts 
the covariance between adjacent bins determined from 100 Monte 
Carlo realisations. 



consider the correlation estimator probably valid on all of 
the scales plotted and certainly valid on scales larger than 
0.3 arcmin. Note that the consistency of the correlation esti- 
mator across all scales indicates that the software we use to 
calculate the estimator is robust. Also, note that the aver- 
age Monte Carlo sampling of u> contains 1180 random points 
in the smallest bin and has a jackknife error of 0.098 — this 
means that the typical fluctuation in our random catalog is 
three-and-a-half times smaller than our quoted error, and 
is dwarfed by the jackknife error, justifying our choice of 
n — 10 in Equation 2. 

In the upper panel of Fig. 2 we compare the various 
error estimates. The general trend of the errors is in good 
agreement, although the Poisson error estimate begins to 
under-predict the error (as compared to the Monte Carlo 
estimate) on larger scales. We assume that the Monte Carlo 
deviation represents a fair estimate of the true error on the 
correlation function and, In Fig. 3, plot the various errors 
taken in ratio to the Monte Carlo deviation. It is obvious 
that the Poisson error is an underestimate on <; 5 arcmin 
scales. We have also estimated the Poisson error by sub- 
stituting DR for DD in Equation 3. Such a change has 
negligible effect, as the simulated QSOs have a largely ran- 
dom distribution. The jackknife and field-to-field error esti- 
mates are much better than the Poisson estimate and con- 
stitute reasonable estimates on scales from 0.2 arcmin to 
nearly a degree. The field-to-field error, however, is perhaps 
a 20 per cent overestimate on larger scales, where the jack- 
knife error remains in line with the Monte Carlo estimate. 
The jackknife error estimate has an additional advantage 
over the field-to-field estimate. When the number of data 



points on any plate approaches zero, the field-to-field esti- 
mate is ill-defined but the jackknife estimate remains accu- 
rate until the number of data points across the entire survey 
approaches zero. 

In Fig. 3, we also plot the correlation due to the covari- 
ance between adjacent bins. This covariance is low - almost 
within the 10 per cent expected standard error. However, 
this covariance measure demonstrates that no correlation 
between adjacent bins is artificially introduced by our ran- 
dom catalogues - it does not guarantee that the data will 
show no covariance. We will quote the significance of results 
by estimating the correlation function and its associated er- 
ror in one 'large' bin (usually out to 10 arcmin), to minimise 
any covariance in the data on these scales. Throughout this 
paper, we adopt the estimate of the correlation function de- 
fined by Equation 2, together with the jackknife error es- 
timate of Equation 5 as both seem fair over scales we will 
probe. 



3.2 The Cross-Correlation of 2QZ QSOs and 
Galaxies 

In Fig. 4 we plot the cross-correlation of all 2QZ QSOs 
that meet our selection criteria (z > 0.4 and 2QZ identi- 
fication of '11') against SDSS EDR galaxies (in the 2QZ 
NGC strip) and APM galaxies (in the 2QZ SGC strip). The 
upper panel of Fig. 4 shows the cross-correlation individu- 
ally for the strips — galaxies are anti-correlated with QSOs 
in both. The anti-correlation is slightly stronger in the NGC 
strip but not significantly so. In the lower panel of Fig. 4, we 
plot the cross-correlation for both 'directions'. A significant 
anti-correlation is detected irrespective of whether we centre 
on galaxies and count QSOs or centre on QSOs and count 
galaxies, indicating that our random catalogues consistently 
account for the angular selection of QSOs or galaxies. 

If we bin the data displayed in Fig. 4 in a single bin 
of extent 10 arcmin and estimate the correlation function 
and (la) jackknife error, we find there is an anti-correlation 
of strength uj(< 10 arcmin) = -0.007 ± 0.0025 (10 arcmin 
is about 1 /i _1 Mpc at the median galaxy redshift of 0.15). 
The significance of this result is 2.8a for ui qg and 2.2a for 
ujgq. Although the anti-correlation is slightly less significant 
for ujgq, it is also slightly stronger, i.e. lo(< 10 arcmin) = 
—0.008 ± 0.0035. Further, particularly on large scales (> 
8arcmin), the error in u> gq is 60-80 per cent of the error in 
LOqg. It is unclear exactly why the errors are slightly larger 
when the analysis is carried out centring on galaxies but 
it is likely due to small discrepancies, perhaps second-order 
gradients, between the random catalogues used to mimic the 
different angular selection functions of the QSO and galaxy 
samples. 

The anti-correlation displayed in Fig. 4 is very strong 
on small scales and agreement between the two 'directions' 
of the correlation function is excellent. For instance, both 
ujg q and uiqg show an anti-correlation of strength -0.02 out 
to 3 arcmin at 3a significance. Note that the innermost 
bin barely contributes to this particular signal, as it con- 
tains less than 100th of the pairs in the bin at 2 arcmin. 
When modelling, we will consider u> qg , the slightly weaker, 
slightly more significant result. This is mainly because ul- 
timately, when discussing the anti-correlation in terms of 
lensing, we will compare the galaxy-galaxy auto-correlation, 
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Figure 4. The cross-correlation of 2QZ QSOs against SDSS EDR 
galaxies (in the NGC 2QZ strip) and APM galaxies (in the SGC 
2QZ strip). The upper panel displays the cross-correlation signal 
for the 2 strips individually. The lower panel displays the signal 
combined for both strips. The lower panel shows estimates for 
both 'directions', centring on QSOs and counting galaxies (u) qg ) 
and centring on galaxies and counting QSOs (u> gq ). Error bars 
represent la jackknife errors. Labels note the number of objects 
of each population present within the confines of the 2QZ bound- 
aries. Points within the same bin have been offset slightly for ease 
of display. 

ujgg, to ui qg , which shares the same random catalogue. It is 
useful, though, to have shown a significant, consistent anti- 
correlation between QSOs and galaxies irrespective of the 
'direction' of the cross-correlation. 

3.3 Is the Anti-Correlation Between Galaxies and 
QSOs a Selection Effect? 

Certainly, there is a significant anti-correlation between 
galaxies and 2QZ QSOs. It is natural to ask if the sig- 
nal arises when constructing the QSO or galaxy catalogues. 
There are several techniques that might produce an anti- 
correlation between QSOs and galaxies. The initial construc- 
tion of the 2QZ UVX target catalogue removed extended 
images. Although all 'high' redshift (z > 0.5) QSOs should 
appear stellar, they may merge with foreground objects to 
look extended on images. A bright (bj < 19.5) subsample of 
these extended images should end up in the 2dF Galaxy Red- 
shift Survey (henceforth 2dFGRS) and thus appear in deficit 
in the 2QZ. It turns out that this affects separations between 
galaxies and QSOs on scales of about 8 arcsec (Madgwick 
et al. 2002) , smaller than the scales we are probing. Restric- 
tions on the placement of 2dF fibres means the minimum an- 
gle between objects in a 2QZ field is about 30 arcsec, which 
might mean a paucity of objects at small separations. How- 
ever, this restriction should not include QSO-galaxy separa- 
tions as, although the 2QZ was carried out simultaneously 
with the 2dFGRS, QSO observations were given a higher 
observational priority, so we would expect few QSOs to be 



rejected because of their proximity to galaxies. Addition- 
ally, the vast majority of fields in the 2dF survey were re- 
peatedly observed to circumvent fibre-allocation problems. 
Myers et al. (2003) suggest that a 30 arcsec restriction on 
the minimum angle between 2dF objects in any field leaves 
no significant signature on these scales. 

The easiest way to judge measurement systematics in 
the 2QZ is to consider a control sample of objects that un- 
derwent identical data reduction as the QSOs but should dis- 
play no cosmological signatures. There are 10587 stars (with 
'11' identification quality) in the 2QZ. In Fig. 5 we plot the 
cross-correlation of these stars against our galaxy samples. 
The upper panel of Fig. 5 compares the cross-correlation 
estimate for the NGC and SGC 2QZ strips. The agree- 
ment is reasonable, although the NGC sample is slightly 
more positively correlated with galaxies. Note that we might 
not expect the stellar correlation functions to be zero on 
all scales — gradients exist in local structure, which are not 
recreated in the random catalogues and which will make the 
correlation signal higher or lower on average. However, we 
would expect the stellar signature to be flat across the scales 
of interest. This is highlighted in the lower panel of Fig. 5, 
where we display the star-galaxy and galaxy-star cross cor- 
relations. Unlike in the QSO-galaxy case, there is a discrep- 
ancy in the large-scale zero-point of the correlation function 
that depends upon the 'direction' of the cross-correlation. As 
we might expect, when the random catalogue is constructed 
to match the stellar distribution, the zero-point of the cor- 
relation function drops significantly (to about -0.03). The 
large-scale value of the correlation function is zero when 
the random catalogue is constructed to match the galaxy 
distribution, which is free from (at least genuine physical) 
gradients. The key point, is that the correlation functions for 
stars and galaxies are flat (deviating at most 0.5<r from their 
respective zero-points on scales larger than 6 ~ 0.4 arcmin), 
indicating that systematics in the construction of the galaxy 
and QSO samples are low, hence induce no false correlations 
in our QSO-galaxy cross-correlations. The innermost points 
(6 < 0.4 arcmin) plotted in Fig. 5 seem to genuinely deviate 
from the zero-point and may be representative of merged 
images, fibre placement signatures or some other system- 
atic. We will continue to plot these points in figures in this 
section but will not consider them in any modelling analysis 
or quotes of the significance of a signal. 

3.4 Cosmological Explanations For the 

Anti-Correlation Between QSOs and Galaxies 

An obvious physical effect, other than lensing, that could 
cause an anti-correlation between QSOs and galaxies is dust 
in galaxies obscuring background QSOs. This would lead to 
a dearth of QSOs around galaxies. Alternatively, if the anti- 
correlation is due to statistical lensing, we might be able to 
see the variation of the cross-correlation between galaxies 
and QSOs with the magnitude of the QSO sample. 

3.4-. 1 Is Intervening Dust the Cause of the 

Anti- Correlation Between QSOs and Galaxies? 

We can use colour information to determine if interven- 
ing dust preferentially distributed around galaxies could 
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Figure 5. The cross-correlation of 2QZ stars against SDSS EDR 
galaxies (in the NGC 2QZ strip) and against APM galaxies (in the 
SGC 2QZ strip). The upper panel displays the cross-correlation 
signal for the 2 strips individually. The lower panel displays the 
signal combined for both strips. The lower panel shows estimates 
for both 'directions', centring on stars and counting galaxies (to sg ) 
and centring on galaxies and counting stars (u}g S )- Error bars rep- 
resent lcr jackknife errors. Labels note the number of objects of 
each population present within the confines of the 2QZ bound- 
aries. Points within the same bin have been offset slightly for ease 
of display. 



Figure 6. Limits on the amount of 'typical' dust that could ac- 
count for the QSO-galaxy anti-correlation measured in Fig. 4. The 
upper panels show the average colour of QSOs in bins centred on 
SDSS EDR galaxies (in the NGC 2QZ strip) and APM galaxies 
(in the SGC 2QZ strip). Error bars in these panels represent the 
standard deviation (lcr) in 1000 bootstrapped simulations with 
the same QSO positions as the 2QZ but scrambled colours. The 
lower panels translate these lcr error bars into limits on absorp- 
tion in the bj band (Ai, ). The absorption limits are translated 
into limits on the QSO-galaxy cross-correlation using a simple 
model outlined in the text and displayed against the points and 
errors on ui qg from Fig. 4, represented by triangles. 



remove QSOs from the 2QZ catalogue out to 10 arcmin 
(~ 1 /i _1 Mpc). Our method is similar to the correlation 
estimator of Equation 2 but instead of counting the average 
number of QSOs in differential annuli around galaxies, we 
work out the average colour of QSOs. Errors are calculated 
using 1000 bootstrapped simulations with the same QSO 
positions as the 2QZ catalogue but scrambled colours. 

Knowing the expected QSO colour on degree scales, we 
can calculate the allowed colour excess, E(B — V), in a given 
bin for both sets of measured 2QZ colours, u — bj and bj — r. 
Schlegel, Finkbeiner & Davis (1998) provide tables to con- 
vert from the colour excess to the amount of absorption 
by dust. Thus we can constrain the amount of dust around 
galaxies along QSO lines of sight. 

Our lcr limits on absorption can be converted into a 
limit on the correlation function using a simple model out- 
lined in Boyle, Fong & Shanks (1988). Dust around galaxies 
will cause an absorption (which we calculate from our mea- 
sured colour excesses) that will alter the magnitude limit of 
the QSO number counts close to galaxies 

w + 1 _ N(< m)g a i _ N(< m - A b] ) F ield , g , 
~ N(< m) Field ~ N(< m)Field 

where N(< m) is the integrated number counts, A bj is 
the absorption in the fej band and the subscripts 'Gal' and 
'Field' represent values near to galaxies and in the field, re- 
spectively. Equation 8 can easily be simplified if the number 
counts are represented by a power law but to be exact, we 



will simply use the full fitted form of the number counts of 
2QZ QSOs (see Fig. 1). 

We now have a simple model that converts the error on 
our measurement of the average colour of QSOs near galax- 
ies into a limit on the observed anti-correlation due to dust. 
In the upper two panels of Fig. 6, we show the average colour 
of QSOs around our galaxy samples with bootstrapped error 
bars, for 2QZ colours. A solid line marks the expected value 
from our scrambled bootstrap simulations. Except, perhaps, 
for the innermost bin, there is no significant deviation in the 
colour of QSOs from the expected value. The reddening dis- 
played in the innermost bin of the u — bj colours corresponds 
to an increase to the blue in the bj —r colours, suggesting it 
is a small-scale measurement artefact or a statistical fluctua- 
tion, rather than due to dust. In the lower panel of Fig. 6, we 
translate the bootstrapped error bars into limits on absorp- 
tion by dust around galaxies. The solid u — 6,t and dashed 
6j — r lines in the lower panel represent the lcr limit on the 
anti-correlation due to dust allowed by our colour limits. 
The anti-correlation between QSOs and galaxies measured 
in Fig. 4 is plotted for comparison. The lcr limits are insuf- 
ficient to account for the anti-correlation between galaxies 
and QSOs, in fact, the bj — r limits are marginally rejected, 
at a 2cr level, and (at this level of rejection) could only ac- 
count for about 30 per cent of the anti-correlation between 
QSOs and galaxies (out to 10 arcmin). 

Schlegel, Finkbeiner & Davis (1998) base their absorp- 
tion laws, from which our absorption estimates are de- 
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rived, on the difference in reddening between local, lightly- 
reddened standard stars and more distant reddened stars in 
the Milky Way (Cardelli, Clayton & Mathis 1989; O'Donnell 
1994). The reddened stars are chosen to sample a wide range 
of interstellar environments (Fitzpatrick & Massa 1990) and 
Schlegel et al. demonstrate that their dust laws reproduce 
the reddening (as compared to the Mgll index) of a sample of 
~ 500 elliptical galaxies that have broad sky coverage. How- 
ever, by design, the absorption law used by Schlegel et al. 
only applies to our Galaxy. There is no reason to believe it is 
universal. In fact, though the Magellanic Clouds have been 
shown to have similar absorption laws to the Milky Way 
(Koornneef 1982; Bouchet et al. 1985), other local (z < 0.03) 
galaxies have been shown to have 'greyer' absorption laws 
(Calzetti, Kinney, & Storchi-Bergmann 1994; Kinney et al. 
1994), meaning that more dust absorption (approximately 
25 per cent in the bj band) is expected for the same red- 
dening. At the 2a level, none of these alternative dust laws 
could provide enough absorption to explain the observed 
QSO-galaxy anti-correlation. 

It is possible to construct dust models that would rec- 
oncile the lack of reddening of the QSO sample near galaxies 
with a paucity of QSOs around galaxies. For instance, the 
UVX QSO selection of the 2QZ means that QSOs that were 
excessively reddened may be lost entirely from the 2QZ, es- 
pecially if the measured colours of 2QZ QSOs exhibit a large 
scatter around their intrinsic colours (which they don't, at 
least at z < 2 - see Croom et al. 2004). Also, if there was 
a lot of dust close to some galaxies, which completely ob- 
scured QSOs without reddening them, then the clustering 
of galaxies alone might lead to the observed galaxy-QSO 
anti-correlation. However, though various dust models can 
be constructed to explain a galaxy-QSO anti-correlation, it 
is very difficult to reconcile any dust model with the positive 
QSO-galaxy correlations seen in other samples (Williams & 
Irwin 1998; Gaztanaga 2003). 

Acknowledging the provisos outlined above, we ten- 
tatively proceed assuming that dust around galaxies can- 
not account for the anti-correlation between QSOs and 
galaxies — but can we find definitive evidence that gravita- 
tional lensing is responsible? 

3.4-2 Dependence of the Cross-Correlation Signal on 
Magnitude and Redshift 

Perhaps the key prediction of magnification bias is that an 
enhancement of QSOs is expected near foreground lenses 
when the slope of the QSO number-magnitude counts is 
greater than 0.4 and a deficit of QSOs around the same 
lenses when the slope is less than 0.4. The QSO number 
counts are shown in Fig. 1. The 'knee' of the magnitude dis- 
tribution, where the slope is 0.4, lies around bj = 19.1 to 
b.j = 19.6. A very simple model would predict (under the 
assumption that lensing samples QSOs up to about a mag- 
nitude fainter than the sample limit) a positive correlation 
between QSOs and galaxies up to a (QSO) bj magnitude of 
around 18.1-18.6, no correlation between QSOs and galaxies 
in the range 18.1-18.6 to the knee of the magnitude counts, 
and an anti-correlation between QSOs and galaxies fainter 
than about bj = 19.6. 

At the time of writing, no author has yet shown the 
dependence of the cross-correlation signal with magnitude 



from a positive correlation at bright QSO magnitudes to an 
anti-correlation at faint magnitudes, although some authors 
have shown the transition from a positive correlation to zero 
correlation (Williams & Irwin 1998; Gaztanaga 2003). This 
is mainly a problem of QSO sampling - as yet no single 
survey spans the QSO magnitude distribution in a manner 
that produces significant numbers of QSOs at both bright 
and faint magnitudes. The combined area and depth of the 
2QZ and 6QZ allows us to trace the dependence of the 
cross-correlation between QSOs and galaxies with magni- 
tude across the knee of the QSO number-magnitude counts 
for the first time. In the upper panel of Fig. 7 we show 
the dependence of the QSO-galaxy cross-correlation signal 
(measured out to 10 arcmin) in (differential) magnitude bins 
spanning the range 16.25 < 6,j < 20.85. QSOs are taken from 
the 2QZ for 6.j > 18.25 and from the 6QZ for bj < 18.25. 
There is a loose trend in the data suggesting that the bright- 
est QSOs are positively correlated with our galaxy samples 
(at the 2.2a level for bj < 16.65) and the faintest QSOs are 
anti-correlated with our galaxy samples (at the 2.3<r level for 
fej > 19.85) and there is no significant result for the remain- 
der of the magnitude range. To test whether this trend al- 
lows us to favour lensing over dust in foreground galaxies as 
a cause of the QSO-galaxy cross- correlation signal, we have 
constructed two toy models. The best-fitting models are dis- 
played in Fig 7. The 'dust' model is essentially as shown in 
Equation 8 with At , , the absorption (on 10 arcmin scales to 
match the data in Fig. 7) as the free parameter. The relative 
numbers in a given magnitude bin are calculated from the 
full integrated counts displayed in Fig. 1. The 'lens' model 
assumes that LO qg = K (2.5a(m) — 1) where K is considered 
a constant (see Equation 15 below) out to the 10 arcmin 
scale of interest. There are actually two free parameters in 
this model, K and m where m is the number of magni- 
tudes fainter than the bin of interest to sample the slope, a, 
of the integrated number counts. The best dust model has 
A bj = 0.005 with a reduced x 2 value of 1.20 (P(x 2 ) = 0.29) . 
The best lens model has K = 0.015, m = 2.1 and a reduced 
X 2 of 1.04 (P(x 2 ) = 0.41). Neither of these simple models 
can be ruled out by the data. We cannot say with any con- 
fidence, then, that we have detected the full dependence of 
the cross-correlation signal with magnitude predicted by sta- 
tistical lensing. This is not necessarily surprising, as there 
are very few objects in the 6QZ and only 256 QSOs brighter 
than 6,j = 18.1 that meet our usual redshift and identifi- 
cation quality selection criteria. Nevertheless, we do see a 
trend away from anti-correlations for brighter samples, in 
line with a lensing hypothesis. 

In the lower panel of Fig. 7, we show the variation of the 
cross-correlation of 2QZ QSOs against our combined galaxy 
samples with redshift in differential bins of z = 0.4. A sim- 
ple lensing model predicts no detectable trend in the cross- 
correlation signal with QSO redshift. We might expect a 
stronger signal at larger redshifts, as we will, on average, 
sample fainter QSOs in these bins and thus observe the sig- 
nature of changes in magnitude in the redshift distribution. 
Indeed, we see a reasonably consistent anti-correlation for 
all redshift bins. The signal is slightly stronger at high red- 
shifts but not significantly so. The lowest redshift QSOs are 
significantly correlated with our galaxy samples, no doubt 
due to genuine clustering at these redshifts, justifying our 
cut of 2 < 0.4 in other analyses throughout this paper. 
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Figure 7. The cross-correlation of 2QZ QSOs against SDSS EDR 
galaxies (in the NGC 2QZ strip) and against APM galaxies (in 
the SGC 2QZ strip) and their dependence on fej magnitude and 
rcdshift. The upper and lower panels display the cross-correlation 
signal measured out to 10 arcmin for subsamples of 2QZ and 
6QZ QSOs in magnitude and redshift bins respectively. Error bars 
represent Ict jackknife errors. The 'lensing' and 'dust' models are 
described in the text. 



In this section, we have shown a significant anti- 
correlation between faint QSOs and galaxies, and that is 
not a systematic effect of the QSO selection. We rule out 
the possibility that the majority of the signal is due to dust 
distributed around galaxies at the 2a level by comparing 
QSO colours in the field and close to galaxies, noting the 
caveat that models can be constructed where QSOs are ob- 
scured by dust without being reddened. We have shown that 
the anti-correlation is, however, consistent with lensing pre- 
dictions. We will now model the anti-correlation assuming 
it is due to lensing and consider cosmological implications. 



4 STATISTICAL LENSING MODELS 

In this section, we outline two lensing models we shall use in 
describing the anti-correlation between QSOs and galaxies. 
Both models compare the cross-correlation between QSOs 
and galaxies to the auto-correlation of galaxies to determine 
galaxy bias with respect to the mass traced by QSO light. 
The first, due to Williams & Irwin (1998; see also Williams 
& Frey 2003), uses a linear biasing prescription to relate 
fluctuations in the foreground mass distribution to QSO 
magnification. Williams & Irwin showed that this simple 
model agrees well with more complicated simulations (Dolag 
& Bartelmann 1997; Sanz, Martinez- Gonzalez & Benitez 
1997). The second model, due to Gaztanaga (2003), allows 
for scale-dependent bias, and more fully considers the red- 
shift distributions of the source and lens populations. 



4.1 Williams &: Irwin Model 

The convergence of lensing matter, k, is defined as 



<0) 



T. cr {D u D s ) 



(9) 



where E(Z)j,#) is the surface mass density of the lensing 
material. The critical mass surface density is a function of 
the redshift of background source QSOs (z s ) and of the fore- 
ground lensing matter (z), and is defined 
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for (angular diameter) lens distance Di, source distance D s 
and lens-source separation Di 3 . We take z s — 1.5, the me- 
dian redshift of the 2QZ, for the background source redshift. 
Although this seems a rather extreme approximation of the 
actual distribution of QSOs, Bartelmann & Schneider (2001) 
suggest it is fair. 

Williams & Irwin model the lensing material as a 
smooth slab that extends over the redshift range z = to 
z — z max = 0.3, where the extent of the redshift distribution 
is estimated using the magnitude-redshift selection function 
of Baugh & Efstathiou (1993), which provides a good fit 
(Maddox et al. 1996) to the Stromlo-APM Survey redshift 
distribution (Loveday et al. 1992a; Loveday et al. 1992b). We 
will also use the selection function of Baugh & Efstathiou to 
model the galaxy redshift distribution. Note that just over 
95 per cent of galaxies are included out to z max = 0.3. 

Lensing arises due to angular fluctuations in the pro- 
jected matter density around the mean. The mass fluctu- 
ations can be characterised by the density contrast 8(8), 
which measures the mass density at a given scale relative 
to the mean across the sky. As the density across the en- 
tire sky is the mean, the distribution of density contrast 
is normalised (J P(S\9)8d8 = l). The effective lensing con- 
vergence due to mass fluctuations are enhanced above the 
mean, yielding K c ff(#) = R(5 — 1). In this model, bias is 
scale-independent, meaning that galaxy fluctuations trace 
mass fluctuations as <5g(#) — 1 = b[S(6) — 1]. The surface 
density of a slab of matter of thickness cdt at redshift z, is 
£ = pcritfio(l + z)' i cdt, where p crit is the critical density of 
the Universe. Thus 



(1 + *) 



fdz 

dz (11) 



The enhancement of QSO number density over the 
mean around any galaxy is given, for QSO number-counts 
with slope a, by 



1 + 2k 



(2.5a - 1) 



(12) 



where (i«l| 2/t e fi is the lensing magnification in the weak 
regime (and we've performed a Taylor Expansion). Now, 
the QSO-galaxy cross-correlation can be estimated by the 
enhancement in galaxy counts across the probability distri- 
bution of galaxy density contrasts 



P(8G\0)5 G 8QdSG 



(13) 



© 0000 RAS, MNRAS 000, 000-000 



10 A. D. Myers et al. 



substituting Equation 12 into the previous equation and re- 
membering J P(5\9)5dS — 1, one can rearrange to show 



oj qg {9) = (2.5a - 1) 
so, finally 



2k 



I 



P(5 G \e)S G SGd8G 



- 1 



2k 



(14) 



(15) 



uj qg {9) = (2.5a - 1)— u; gg (9) 

relates ui gg (9), to co gg (9) and b, the (scale-independent) bias 
parameter. Other than b, cosmology is contained in the (di- 
mensionless) mean convergence R (Equation 11). The QSO 
number counts are included in the power-law slope, a. 



4.2 Gaztanaga Model 

This model is reviewed in the Appendix, and described in 
detail by Gaztanaga (2003; see also Myers 2003). Gaztanaga 
(2003) has shown that a good power-law expression for scale- 
dependent biasing of galaxies is 



b(R) = b , 



O.lft^Mpc 



7? 



(16) 



Whilst Gaztanaga assumed that this bias corresponded to 
the ratio of the galaxy and matter correlation functions (see 
also Guimaraes 2001); b(r) — [£ 99 (r)/£ mm (r)] , we are in 
fact comparing the cross- correlation of QSOs and galaxies 
with the galaxy auto-correlation function, and so are defin- 
ing a bias function via b(r) = £, gg {r) / £, gm {r) ■ Whilst on large 
scales these two definitions should be equal, one expects the 
halo of the central galaxy to affect the shape of £, gm (r) on 
small scales. Therefore we consider a revised version of the 
Gaztanaga model to take this into account. 

Through observation of uj gg and tu gq we can constrain 
the amplitude and slope of the bias correlation function. The 
two-dimensional projection of the galaxy correlation func- 
tion is (consider Peebles 1980) 

u gg {0) = trL&o.i&V-^A^o.i (17) 
whilst the galaxy-QSO cross-correlation is given by 

Lo gq (e) = al 1 b*e 1 -^A gq , . 1 (is) 

where ao.i, the amplitude of fluctuations in the matter corre- 
lation function (averaged over asphere of radius 0.1fe _1 Mpc) 
is defined in Equation A3, A gg< o.i and A gqt o.i depend on the 
radial selection functions of the galaxy and QSO samples 
and the lensing efficiency, and b and 7 denote the slope and 
amplitude of a scale-dependent bias model. In the particular 
case where the two bias definitions ([£ 99 /£ 9 m]; [£ 99 /£mm] ' 5 ) 
are equivalent, b* is equivalent to 60.1 (see the Appendix for 
further details on these parameters, particularly b*). 

The slope of the bias correlation function, 75, can be 
easily determined from 7 99 and 7 9q . The bias parameter, 
60.1, can then be determined 



u gg (0) ^gq.Q-i Qi h 

U) gq (9) Aggfl.l 



(19) 



In the case where the two bias definitions, above, agree 
exactly (as implicitly assumed by Gaztanaga), b* = 60.1 and 
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Figure 8. The galaxy-galaxy auto-correlation combined from 
SDSS EDR galaxies (in the NGC strip) and APM galaxies (in 
the SGC strip) is compared to the faint-end QSO-galaxy cross- 
correlation function. Both correlation functions are fitted by 
power laws, displayed by lines drawn through the respective data. 
Error bars represent Icr jackknife errors. 



7* =76- In this case, one can further derive the slope of the 
mass correlation function, 7, and cto.i can be determined via 
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5 RESULTS AND DISCUSSION 

In Fig. 8 we plot the auto-correlation of galaxies combined 
for galaxies from the SDSS EDR in the NGC 2QZ strip and 
the APM Survey in the SGC 2QZ strip. Fig. 8 also shows 
the anti-correlation between QSOs and galaxies discussed 
throughout this paper. We have plotted the result for the 
QSO sample fainter than bj = 19.6, to ensure we are sam- 
pling QSOs from a region of the number-magnitude counts 
fainter than the 'knee' of the distribution (see Section 3.4.2). 
We fit simple power-laws to the two correlation functions, 
finding that 



ra\ n no/i i 0.008 fl -1.0±0.3 

UqgyO) = -0.024 ±0.007 6 1 

ta\ n oon i 0.015 /i-0.76±0.04 

^99(0) = 0.330 ±0.014 



(21) 
(22) 



where 9 is expressed in arcminutes, 1 arcminute being about 
0.1ft _1 Mpc at the median redshift of our galaxy data. In 
both cases, a simple \ 2 fit suggests good power-law ap- 
proximations to the data, with a reduced \ 2 value of 1.6 
(P(x 2 ) = 0.11) for the galaxy-galaxy correlation fit and 1.4 
(P(X 2 ) = 0.20) for the QSO-galaxy anti-correlation fit. The 
result for the slope of the galaxy-galaxy auto-correlation is 
in excellent agreement with recent small-scale measurements 
for galaxies with similar limiting flux to the sample used here 
(Connolly et al. 2002). 
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Model 




n m = 0.3,^a = 0.7 


f^m — 1 


W&I, 98 


bo.i 


0.13±°;°? 


U " 3Z=C 0.18 


Gaz, 03 


bo.i 

bo.2 


o.o52±8:8«* 

0.06l±8;811 


n i 42 +0.167 
u ' 1 ^ z=c 0.072 
n icc+0.144 

U.lOOZtg Qgg 



Table 1. In the row labelled 'W&I, 98' we list b, evaluated at 
9 = 1 arcmin (~ 0.1 /i _1 Mpc), as measured from Equation 23, 
with £ cr (z, z s ) calculated using Equation 10 with z s = 1.5 for the 
median QSO rcdshift. In the row labelled 'Gaz, 03' we list b evalu- 
ated at 9 = 1 arcmin, and 9 = 2 arcmin (~ 0.2 h~ Mpc), as mea- 
sured from Equation 19. Both estimates arc shown for ACDM and 
EdS cosmological models, assuming Hq = 70kms _1 Mpc -1 . 



5.1 Williams & Irwin Model 

Taking our measured values for the correlation functions, 
evaluated at 9 = 1 arcmin (~ 0.1 /i _1 Mpc), and a faint end 
slope of a = 0.29 ± 0.03, Equation 15 (Williams & Irwin 
1998) reduces to 



b . 



7.56 il;!! k, 



(23) 



a function of the convergence, R. We calculate this as- 
suming either an Einstein-de- Sitter (henceforth EdS), or 
ACDM cosmology, and the resulting values of b are displayed 
in Table 1. The model is somewhat dependent on the param- 
eter 2 max of Equation 11. For instance, increasing 2 m ax from 
0.3 to 0.4 increases the estimates of &o.i by 50 per cent, with 
the errors scaling accordingly. In our samples, 99.5 per cent 
of galaxies lie at z < 0.4. 

Note that, as the measured slopes of uj gq {9) and u) gg {9) 
are not the same, a scale-independent model of bias is only 
marginally accepted. It is relatively easy to extend Equa- 
tion 15 to a simple model of scale-dependent bias, obtaining 



b = bo.i 



0.1/i _1 Mpc 



-0.24+0.30 



(24) 



where bo.i is shown in Table 1 and r is equivalent to 9 ex- 
pressed in /i _1 Mpc at the mean redshift of our galaxy data. 
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Figure 9. Likelihood contours in the bo.i - 76 plane for a fit 
to the QSO-galaxy cross-correlation function, u) gq (0), assuming 
ACDM. Contours are plotted for \ 2 values corresponding to a 
one-parameter confidence of 68 per cent (dotted contour), and 
two-parameter confidence of 68 and 95 per cent (dashed con- 
tours). The best fit model obtained (marked with a +) has 
bo.i = 0.052 and jf, = —0.23. The stars mark the models plotted 
with dashed lines in Figure 10. 



7b plane. There is a clear degeneracy, with stronger scale- 
dependence of bias (i.e. more negative 75) implying a lower 
value of bo.i. The uncertainty in our determination of b is 
somewhat larger than the quoted error in Gaztanaga (2003), 
and, as demonstrated by Fig. 9, is considerably skewed in b- 
space. From Fig. 9 it is possible to reconcile the value of bo.i 
obtained with the method of Gaztanaga with the somewhat 
higher value of 60.1 obtained using the Williams & Irwin 
model: the latter assumes scale-invariant bias, or jt = 0, 
which corresponds to a higher value of 60.1 ~ 0.12 from the 
bo.i ~ 76 degeneracy in the Gaztanaga model fit. 



5.2 Gaztanaga Model 

Using the measured slopes 7 99 and 7 9q , we find a slope 
for the bias correlation function of 76 = —0.24 ± 0.30. We 
now use Equation 19 to determine the galaxy bias in both 
ACDMand EdS cosmologies, with H = 70kms _1 Mpc -1 , 
for 0.1 /i _1 Mpc scales. The results are displayed in Table 1. 
The analysis is repeated measuring b on 0.2/i _1 Mpc scales 
to allow a direct comparison with the results of Gaztanaga 
(2003). The results only have a slight dependence on Ho - In- 
creasing Ho to 100 km s -1 Mpc -1 increases the estimate of b 
by 9 per cent (for both EdS and ACDM cosmologies, and for 
both bo.i and bo.2)- Decreasing Ho to 50kms _1 Mpc _1 de- 
creases b by either 8 per cent (ACDM ) or 7 per cent (EdS). 

Using the fit to co gg (9) from Equation 22, together with 
Equation 19, we derive a model for u> gq (9) as a function of 
bo.i and 76. We fit this model to the cross-correlation data 
by performing a maximum likelihood analysis, determining 
the likelihood of each model by calculating the \ 2 value 
of each fit. Fig. 9 shows likelihood contours in the bo.i - 



5.3 Discussion 

It is interesting that our bias predictions, based on a de- 
tection of a galaxy-QSO anti-correlation, agree well with 
the predictions of an independent author (Gaztanaga 2003) 
who reported a positive galaxy-QSO correlation, working 
with bright QSOs and (mostly) independent data. Both of 
the methods we have used in this paper, which use quite 
different lensing models to determine the amplitude of the 
galaxy bias parameter, b, consistently agree that on scales 
of 0.1 /i _1 Mpc, galaxies are strongly anti-biased (i.e. b < 1) 
with a bias parameter of b ~ 0.1. Using a simple model 
(Williams & Irwin 1998) and assuming that lensing mat- 
ter can be represented by a uniform slab extending out 
to the redshift where 95 per cent of the surveyed galax- 
ies are included, and the lensed QSOs may be represented 
by a single population at z = 1.5, we find bo.i — 0. 13± ;or 
for a ACDM cosmology. A more realistic model that traces 
spherical fluctuations in the underlying lensing matter on 
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Figure 10. A comparison of some recent measurements of the 
bias parameter with those made in this paper. The filled circles 
are measurements of the bias parameter deduced from the ratio 
of the correlation functions in Fig. 8 using Equation 19 (i.e. the 
'Gaztanaga Model'). The angular bins of the correlation func- 
tions have been projected to the median redshift of the galaxy 
distribution (z ~ 0.12). The solid line represents the best fit- 
ting bias model, with 60.1 = 0.052 and 75 = —0.23. The various 
dashed lines depict the error ranges taken from the 68% likeli- 
hood contour; the models chosen are shown by stars in Figure 9. 
Each measurement displayed from this paper is calculated using 
a ACDM cosmology. Other points represent recent measurements 
of the bias parameter from the literature. 



0.1/i _1 Mpc scales, across the redshift distributions of our 
QSO and galaxy samples (Gaztanaga 2003), yields 60.1 = 
0.04±o;of. Note that, whilst traditionally galaxy bias is mea- 
sured from the ratio between the galaxy auto-correlation 
and the mass auto-correlation, as we are comparing the 
cross-correlation of QSOs and galaxies with the galaxy auto- 
correlation function, we are instead defining a bias function 
via b(r) = Zgg(r)/Z gm (r). 

A value of b ~ 0.1 might seem at odds with observations 
of much higher values of b ~ 1 on Megaparsec scales (Verde 
et al. 2002; Lahav et al. 2002; Hoekstra et al. 2002), assum- 
ing a simple linearly-biased ACDM mass distribution, sug- 
gesting more mass around galaxies than expected. However, 
the slope of the scale-dependent bias parameter is found to 
be 71, = —0.24 ± 0.3. Although we cannot rule out a linear 
bias parameter on scales of 0.1/i -1 Mpc from the slopes of 
our fitted correlation functions, the fact that many other 
authors determine much higher values of 6 ~ 1 on Mega- 
parsec scales (Verde et al. 2002; Lahav et al. 2002; Hoekstra 
et al. 2002) suggests that bias might be a strong function 
of scale on 100fa -1 kpc scales. In Fig. 10 we plot the bias 
implied across a range of scales from measurements made 
in this paper and compare them to measurements made by 
other authors (Williams & Irwin 1998; Verde et al. 2002; 
Lahav et al. 2002; Hoekstra et al. 2002; Gaztanaga 2003). 
The models discussed in this paper allow a scale-dependent 
model (with b ~ r 0,5 ) that is consistent both with our ob- 



servations on 0.1 h x Mpc scales, as well as other measures 
of the galaxy bias parameter, with 6 ~ 1, on larger scales 
(Lahav et al. 2002; Verde et al. 2002). Whilst the models 
shown in Fig. 10 appear inconsistent with the measurements 
of Hoekstra et al. (2002), it may be that the simple model of 
scale-dependent bias assumed (equation 16) is not a good de- 
scription of bias on larger scales. Indeed, on very large scales, 
bias is expected to be linear and hence independent of scale. 
On the other hand, Williams & Irwin (1998) suggest galaxies 
are highly anti-biased (b ~ 0.1) even on 5-10/i _1 Mpc scales, 
which would only be consistent with our observations if bias 
has no scale-dependence and is certainly inconsistent with 
the results of other authors (Lahav et al. 2002; Verde et al. 
2002; Hoekstra et al. 2002). 

To attempt to explain strong scale-dependence of bias, 
and, therefore, a higher than expected galaxy-mass cross- 
correlation signal on 100 /i -1 kpc scales, we can consider halo 
occupation models of the galaxy distribution (Berlind & 
Weinberg 2002; Jain et al. 2003). The form of the cross- 
correlation in a ACDM universe can be determined from 
simulations, using halo occupation models to populate dark 
matter haloes with galaxies (Berlind & Weinberg 2002; Jain 
et al. 2003), and taken in ratio with the real-space galaxy 
auto-correlation function measured from the APM galaxy 
catalogue (Baugh 1996), such models typically yield a value 
of b ~ 0.6 on 0T/i _1 Mpc scales. Thus some level of anti- 
bias arises naturally in halo occupation models (albeit not 
as much as observed), going some way towards explaining 
the discrepancy. However, these models are complicated and 
depend strongly on galaxy type, and are thus not well con- 
strained on the scales probed. Whilst a high QSO-galaxy 
cross-correlation signal places a constraint on such models, 
it may be possible to reconcile them with a better under- 
standing of how galaxies populate haloes. 

By comparing the two definitions ([£99 /Cam] and 
[£gg/£mm]°' J ) of b(r) in a range of simulations, Berlind and 
Weinberg (2002) demonstrated that whilst they are not 
in perfect agreement in the scale-dependent bias regime 
(0.1 < r < 4/i _1 Mpc), the two definitions of b(r) have a 
very similar shape over this range. However, given the large 
anti-bias found, and strong scale-dependence of bias implied, 
it is quite likely that they will in fact be considerably differ- 
ent. Therefore, care needs to be taken in interpreting the 
results of this analysis. In particular, we have chosen not 
to follow the analysis of Gaztanaga (2003) in deriving the 
slope of the mass correlation function, 7, and out under the 
assumption that the two bias definitions are equal, as it is 
likely that this would lead to a large systematic error, and 
hence misleading results. 

Finally we should point out that in this work we have 
assumed the weak lensing approximation, /i ~ 1+2k. Takada 
and Hamana (2003) have shown that a non-linear magnifi- 
cation correction will enhance the amplitude of the magni- 
fication correlation by around 10-25 per cent on arcminute 
scales for the QSO-galaxy cross-correlation signal, and such 
a correction would therefore boost our bias estimates by 
this fraction. Whilst in the right direction, such a correc- 
tion is too small to explain the observed discrepancy in a 
ACDM cosmology. Note also that Table 1 indicates that an 
EdS cosmology with a value of 60.1 ~ 1 agrees with the data 
at the 2cr level. Though an EdS framework is consistent with 
the previous QSO lensing results of Croom & Shanks (1999) 
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and Myers et al. (2003), it is inconsistent with the majority 
of recent cosmological data. 



6 CONCLUSIONS 

In this paper, we have studied the cross-correlation of galax- 
ies and background QSOs. Galaxies were drawn from the 
APM Survey in the region of the 2dF QSO Redshift Sur- 
vey Southern Galactic Cap strip and the Sloan Digital Sky 
Survey Early Data Release in the Northern Galactic Cap 
strip. The QSO-galaxy cross-correlation function uj gq sug- 
gests that 2QZ QSOs and galaxies are anti-correlated with 
significance (2.8<r) and strength —0.007, over the angular 
range out to 10 arcmin (1 7i _1 Mpc at the median redshift of 
our galaxy samples). This result is unique in that it is pos- 
sibly the first significant anti-correlation detected between 
QSOs and galaxies that is not subject to small sample sizes 
or problems in selecting the populations. 

Our detection of a galaxy-QSO anti-correlation is con- 
sistent with the predictions of statistical lensing theory. 
When combined with the work of Gaztahaga (2003), a 
consistent picture emerges that spans faint and bright 
QSO samples, which, due to the changing QSO number- 
magnitude count slope, have very different clustering prop- 
erties, relative to the foreground galaxy population. 

We have also considered a number of other possible 
explanations for the anti-correlation between QSOs and 
galaxies. Firstly, errors and the correlation estimator were 
proven robust against Monte Carlo simulations. Care was 
taken to demonstrate that there is no significant correla- 
tion (0.5cr positive correlation) between stars (which were 
selected within the 2dF Survey in the same way as the 
QSO sample) and our galaxy samples. We are thus con- 
fident that the anti-correlation between QSOs and galax- 
ies is not a systematic error. The colours of QSOs around 
galaxies were used to place limits on the effect dust (mod- 
elled with an absorption law appropriate to the Milky Way) 
could have in producing an anti-correlation of QSOs with 
galaxies. Whilst our simple dust models could account for 
at most a third of the observed anti-correlation signal (at 2<r 
significance), we do not rule out the possibility that scatter 
in QSO colours might preferentially discard reddened QSOs 
from the 2QZ, nor do we specifically rule out dust models 
that could obscure QSOs without reddening them, such as 
grey dust, or heavy concentrations of dust in some galaxies. 
No dust model, however, could easily explain the positive 
galaxy-QSO correlations found by Gaztanaga (2003). 

We find that, for a ACDM cosmology, galaxies are 
highly anti-biased on small scales. We consider two models 
that use quite different descriptions of the lensing matter 
and find they yield consistent predictions for the strength 
of galaxy bias on 0.1/i _1 Mpc scales of b ~ 0.1. The in- 
ferred strength of this result is in agreement with the work 
of Gaztanaga (2003). The slope of the scale-dependent bias 
parameter is found to be 76 = —0.24 ± 0.3. The fact that 
many other authors determine much higher values of b ~ 1 
on Megaparsec scales (Verde et al. 2002; Lahav et al. 2002; 
Hoekstra et al. 2002) suggests that bias might be a strong 
function of scale. 

To explain such strong scale-dependence of bias, we can 
consider halo occupation models of the galaxy distribution 



(Berlind & Weinberg 2002; Jain et al. 2003). Such models 
predict some level of anti-bias on 0.1 /i _1 Mpc scales, typi- 
cally yielding a value of b ~ 0.6 (Berlind & Weinberg 2002). 
However, these models are complicated and depend strongly 
on galaxy type, and are thus not well constrained on the 
scales probed. Therefore, it may be possible to reconcile 
them with the observations reported here through a better 
understanding of how galaxies populate haloes. 

An alternative interpretation of these results is that 
they indicate that there is significantly more mass present, 
at least on the 100ft _1 kpc scales probed, than predicted by 
ACDM . Myers et al. (2003) recently detected a strong anti- 
correlation between the same faint 2QZ QSOs and groups 
of galaxies. By applying models of gravitational lensing by 
simple haloes they used this anti-correlation signal to de- 
termine the mass of these lensing galaxy groups, concluding 
that the observed anti-correlation favours considerably more 
mass in groups of galaxies than accounted for in a universe 
with density parameter Q m — 0.3. It is hard to account for 
the statistical lensing properties either of the galaxies pre- 
sented here, or the groups of galaxies from Myers et al., in a 
low mass (Q m ~ 0.3) universe with scale-independent bias. 
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APPENDIX A: GAZTANAGA MODEL 

The model reviewed in this appendix is described in detail 
by Gaztanaga (2003; see also Myers 2003). 

Groth & Peebles (1977) and Phillipps et al. (1978) pro- 
posed the 'e' form of the (three-dimensional) correlation 
function to describe the evolution of clustering with redshift 



(3+e) 



(Al) 



For the purposes of this model we shall assume that e = 0, 
corresponding to the stable clustering regime, in reasonable 
agreement with both current observational evidence and the- 
oretical considerations (Wilson 2003; Carlberg et al. 2000; 
Hamilton et al. 1991; Peacock & Dodds 1994) for the small- 
scale correlations of lensing matter we consider, at the low 
redshifts (z ~ 0.15) where our galaxy samples reside. 

As at the median depth of our galaxy samples (z ~ 
0.15), 1 arcmin corresponds to 0.1 /i _1 Mpc, we choose to 
measure fluctuations on this scale. The volume-averaged in- 
tegral of the mass correlation function, £, over a sphere of 
radius R is (Peebles 1980) 



with 



2 

00.1 



R 



72 



(3-7)(4-7)(6-7)2t V O.lfc^Mpc 



ro 



(A2) 



(A3) 



Gaztanaga (2003) has shown that a good power-law ex- 
pression for scale-dependent biasing of galaxies is 



b(R) = b , 



0.1ft _1 Mpc 



R 



(A4) 



Whilst Gaztanaga assumed that this bias corresponded 
to the ratio of the galaxy and matter correlation functions; 
K r ) ~ [igg{ r )/^mm{r)} | , we are in fact comparing the 
cross-correlation of QSOs and galaxies with the galaxy auto- 
correlation function, and so are defining a bias function via 
b(r) = igg{r)/£, gm {r). Whilst on large scales these two defi- 
nitions should be equal, one expects the halo of the central 



© 0000 RAS, MNRAS 000, 000-000 



On Statistical Lensing and the Anti- Correlation Between 2dF QSOs and Foreground Galaxies 15 



galaxy to affect the shape of £ gm (r) on small scales. There- 
fore we consider a revised version of the Gaztanaga model 
to take this into account. 

Firstly we define the volume-averaged galaxy-mass 
cross-correlation, £ gm (R) via, 



» ,'0.1/i" 1 Mpc\ 
Z grn (R)=b*[ H 



(A5) 



In the case where the two definitions of bias given above 
agree exactly, b* = 60.1 and 7* = 7 6- 

The galaxy-averaged correlation function can thus be 
expressed in terms of the mass-averaged function of Equa- 
tion A2 as 

7 + 7* +7b 

(A6) 



s fm , ,„ 2 ( 0.1 h x Mpc 
€g(K) = 60.16 cr .i 



R 

The two-dimensional projection of the galaxy correla- 
tion function is (again consider Peebles 1980) 

u g M = al 1 b . 1 b*6 1 -''^A ggfi . 1 (A7) 



where 

Aggfl.l = Bo. l(799 

with 

Sh(7) = 7(3 - 7)(4 - 7 )(6 - 7)2 7 r(l/2)r( 7 /2 - 1/2) (AQ) 



)/dx(^) 2 X 1 -^(l + 2 )- (3+E) (A8) 



a-T72 



T(7/2) 



and 7s9 = 7 + 7 * + 76. Note that dN g /dx is the galaxy ra- 
dial selection function as a function of comoving distance, \, 
which is easily derived from the normalised redshift distribu- 
tion of the galaxies (Gaztanaga 2003). We use the selection 
function of Baugh and Efstathiou (1993) to model the red- 
shift distribution of our galaxies. 

In the weak lensing approximation, we can relate the 
fluctuations in projected QSO numbers due to lensing to the 
convergence via (Bartelmann 1995; Bartelmann & Schneider 
2001; Menard & Bartelmann 2002) 



5^(6) = (2.5a - 1)6 fj, = 2(2.5a - l)n eS (6) 



d X e(x)S(0, X )- 



(A10) 
(All) 



where 6(6, x) is the density contrast of the underlying lensing 
matter. The lensing efficiency, e( X ), is defined (Bernardeau, 
van Waerbeke & Mellier 1997) as 



E (X) = (2.5a- l)^!^i(l + z) X 



poo 1 

J x X 



Xd \kl2) 



dx! 



where dN q /dx is the QSO selection function in comoving 
coordinates, which we derive from the redshift distribution 
of (quality '11', z > 0.4) QSOs in the 2QZ sample. 

As outlined in Gaztanaga (2003), these equations allow 
us to derive the galaxy-QSO cross-correlation. 

w ot (0) = <t§.i6'0 1 - 7 "A 1w ,o.i 



where 

Agqfi.l 



£0.1(799) 



, , dN g 



(A13) 



e(x)x 1-7M (l + *r (S+e lA14) 



where 7 99 = 7 + 7*. e(x) is typically less than 1 per cent 
of , which often leads to the prediction that cu gq should 



only be observed to be a tiny fraction of ui gg (Dolag & Bartel- 
mann 1997). 

The slope of the bias correlation function, 7 6 , can be 
easily determined from j gg and j gq . The bias parameter, 
60.1 can be determined via 



60.1 = 



^99 W A,g,0.1 



l -e lb 



(A15) 



If the two definitions of bias given above agree exactly, 
6* = 6 .i and 7* = 7 6 , and one could further derive the slope 
of the mass correlation function, 7, and 00.1 via 



2 _ Ugg(6) 2 Aggfi.l ^ 

ai " "99(e) A^J 



(A16) 
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